Toggle navigation
Home
About
About Journal
Historical Evolution
Indexed In
Awards
Reference Index
Editorial Board
Journal Online
Archive
Project Articles
Most Download Articles
Most Read Articles
Instruction
Contribution Column
Author Guidelines
Template
FAQ
Copyright Agreement
Expenses
Academic Integrity
Contact
Contact Us
Location Map
Subscription
Advertisement
中文
Journals
Publication Years
Keywords
Search within results
(((LI Xiaodong[Author]) AND 1[Journal]) AND year[Order])
AND
OR
NOT
Title
Author
Institution
Keyword
Abstract
PACS
DOI
Please wait a minute...
For Selected:
Download Citations
EndNote
Ris
BibTeX
Toggle Thumbnails
Select
Key information extraction algorithm of news Web pages
XIANG Jingjing, GENG Guanggang, LI Xiaodong
Journal of Computer Applications 2016, 36 (
8
): 2082-2086. DOI:
10.11772/j.issn.1001-9081.2016.08.2082
Abstract
(
632
)
PDF
(888KB)(
597
)
Knowledge map
Save
Since information extraction algorithm for Web pages lacks generality and information of title, release-time and source in news Web page, a new information extraction algorithm was proposed to resolve those problems. Firstly, HTML code of Web page was parsed to text sets combined with line number and text; then, extractor began to search boundary of news content from line which the longest sentence belonged to due to the characteristic that the longest sentence belongs to the content of news with an extremely high probability. Meanwhile, the longest common string algorithm was used to extract title, the regular expression and line number were used to extract release-time, and the presentation characteristics of source and line number were used to extract source. Finally, a data set was built to conduct a comparison experiment with an open-source software named newsPaper in accuracy of extraction. Experimental results show that newsExtractor outperforms newsPaper in average accuracy of content, title, release-time and source, it has strong generality and robustness.
Reference
|
Related Articles
|
Metrics
Select
Mobile terminal positioning method driven by road test data
YUAN Guangjie, LI Xiaodong, JIANG Zhaoyi, YUAN Peng, GUO Zhiwei
Journal of Computer Applications 2016, 36 (
12
): 3515-3520. DOI:
10.11772/j.issn.1001-9081.2016.12.3515
Abstract
(
883
)
PDF
(979KB)(
325
)
Knowledge map
Save
The current wireless positioning technology can not adapt to complex environment and has low positioning accuracy. In order to solve the problems, a mobile terminal positioning method driven by road test data was proposed. Firstly, based on the location algorithm of base station and the description algorithm of base station signal coverage, the location-coverage model of base station base was established. By matching the initial parameters of the mobile terminal with the model base, the initial range of the mobile terminal was obtained. Secondly, the road classification database was established based on the extraction algorithm of road feature, and the wireless signal feature matching algorithm was used to match the road information of the mobile terminal. Finally, the model base of longitude-latitude and intensity mapping was established and the precise position of the mobile terminal was determined by using the terminal signal comparison algorithm. The theoretical analysis and experimental results show that the probability of 2 m localization accuracy of the base station reaches 60%, the probability of 3 m reaches 77%, which are improved respectively by about 39% and 12% than those before whitening, and the description algorithm of base station signal coverage can also describe the coverage of base station signal more accurately. The accuracy improvement of the two parts can improve the final positioning accuracy.
Reference
|
Related Articles
|
Metrics